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(57) Abstract: A clock strategy is 
provided for digital circuits inside 
mixed-signal ICs. An integrated 
circuit in accordance with the present 
invention comprises a plurality of pairs 
of latches (LI, L2) being respectively 
clocked by two non-overlapping clock 
signals (FI, F2).The clock strategy is 
aimed at keeping the substrate bounce 
caused by the digital circuits as low as 
possible. Preferably, not all latches are 
clocked at the same time, but delays are 
inserted in the clock lines so that the 
various latches do not consume current 
all at the same time. The invention 
relaxes the demands on the substrate 
sensitivity of the analog circuits. 
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INTEGRATED CIRCUIT HAVING REDUCED SUBSTRATE BOUNCE 



The invention relates to integrated circuits and, more specifically, to a 
reduction of substrate bounce in digital circuits. 

One of the major problems of digital circuits is the substrate bounce that they 
5 cause. The substrate bounce is a great obstacle when integrating analog circuits in the same 
IC. But, with the ever increasing speed of digital processes, the currents in the digital circuits 
have become so high that they even start to affect the local supply (and hence the 
performance) of the digital circuits themselves. 

10 It is, inter alia, an object of the invention to provide improved integrated 

circuits. To this end, the invention provides an integrated circuit as defined in the 
independent claim. Advantageous embodiments are defined in the dependent claims. 

These and other aspects of the invention will be apparent from and elucidated 
with reference to the embodiments described hereinafter. 

15 

In the drawings: 

Fig. 1 shows a prior art clock tree; 

Fig. 2 shows a prior art edge-triggered flip-flop having two latches; 
Fig. 3 shows an embodiment of a flip-flip having two latches clocked in 
20 accordance with the present invention with two non-overlapping clock phases; 

Fig. 4 shows an embodiment of a clock line for non-simultaneous switching of 
the latches of one of the two clock phases; and 

Fig. 5 shows an embodiment of a clock line in which each buffer drives more 

than one latch. 

25 

In many applications, e.g. microprocessors and memories, the maximum speed 
is absolutely crucial. No compromises on top speed are acceptable if you want to be on the 
cutting edge in those fields. But there are also fields, e.g. one-chip processors for analog 
broadcast TV, where top speed of digital circuits is not the number one priority. But, if the 
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digital circuits are not running at their maximum speed to perform their function, this implies 
they are idling part of the time. This is a waste of resources. The invention is based on the 
recognition that this time can be used to solve the substrate bounce problem. And, if there is 
time to spare, in a preferred embodiment of the present invention another highly annoying 
problem of fast digital circuits is tackled as well: clock skew. 

Fast digital circuits usually have a synchronous clock strategy. This means that 
all flip-flops in the circuit are supposed to switch simultaneously. Using one clock buffer to 
drive the load of all flip-flops in the circuit is not practical. So, instead of one clock buffer a 
clock tree is used, as shown in Fig. 1. The clock tree of Fig. 1 has a plurality of buffers B 
between a clock in Ci and a clock out Co. The clock tree has to be designed in such a way 
that all buffers B that are connected to flip-flops switch at the same time. This clock strategy 
has the advantage that the circuits can be extremely fast, but as all flip-flops switch 
simultaneously, the switching currents are very large. This causes both substrate bounce and 
momentary drops of the supply voltage (which slows the circuit down). And, even if the 
clock tree has been designed very carefully, it is hard to guarantee that some flip-flops will 
not switch later than others under all process/voltage/temperature situations. In other words, 
avoiding clock skew is a tough and time-consuming job. 

Fig. 2 shows the construction of an edge-triggered D flip-flop, the standard 
memory component of the vast majority of digital circuits. It consists of two latches LI, L2 
that are driven from one clock C, but with an inverter I between the two latches' clock inputs. 
This way the latches LI, L2 are never open at the same time. This construction also means 
that the input data Di is passed on to the output Do instantly on the active clock edge. But, as 
the latches are fast but not infinitely fast, they do have so-called set-up and hold tunes. 
During these short time intervals around the active clock edge the input data is not allowed to 
change. If it does, the output data of the flip-flop is not reliable. Clock skew occurs when the 
data of one flip-flop arrives at another flip-flop before the hold time of the second flip-flop 
has elapsed. 

The invention is based on the recognition that clock skew can be avoided by 
using two non-overlapping clock phases *1, $2 instead of one (Fig. 3). The skew 
insensitivity is paid for by a reduction of the maximum clock speed. A wonderful feature of 
this clock strategy is that if all latches of the same clock phase do not open at the same time, 
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this does not influence the performance of the circuit. If we use this feature to purposely open 
the latches LI, L2 at different times, we can reduce the peak current that flows in the circuit 
after a clock transition: the switching current is distributed in time. The fact that the 
switching current is distributed in time means that the substrate bounce caused by the digital 
5 circuit is reduced. How much the reduction is depends on the amount by which the peak 
current (or better: its dl/dt) is reduced. 

Fig. 4 shows how the non-simultaneous switching of the latches can be 
accomplished: drive the clock input of one latch L from a delay circuit t connected to another 

10 latch L of the same phase $ 1 or $2. The digital circuit has two structures like the one shown 
in Fig. 4: a first structure in which the latches L of Fig. 4 correspond to the latches LI of Fig. 
3 which are clocked by the clock signal having the clock phase $ 1 , and a second structure in 
which the latches L of Fig. 4 correspond to the latches L2 of Fig. 3 which are clocked by the 
other clock signal having the other clock phase *2. The delay circuit r can simply be a non- 

1 5 inverting buffer. An inverting buffer is smaller, but then two types of latches are needed for 
each clock phase: active-high and active-low. If the clock inputs of all latches of a circuit 
having many latches are placed in series like this, the end result will be an extremely slow 
circuit. So, a compromise has to be found between using one buffer to clock all latches of the 
same phase in parallel and using as many buffers as there are latches to clock them all in 

20 series. 

The solution for this is to use a clock line for both clock phases (Fig. 5), in 
which each node of the clock line drives a number of latches. The number of latches driven 
by each node in the line is simply N/M, with N being the total number of latches of one clock 
25 phase and M being the number of nodes in the clock line. N is determined by the design. M 
must be chosen such that the clock transient is spread out over the whole clock cycle (in 
worst case conditions). 

To keep the speed of the circuit as high as possible, the latches at inputs of the 
30 longest path of the logic should be clocked with the first buffer of the $2 clock line. The 
output of the longest path should be clocked with the last buffer in the $1 clock line. 

In Fig. 3, * 1 and $2 are drawn as signals with 25% duty cycle and evenly 
spaced with respect to one another. But, as there is no logic path between the two latches that 
make up one flip-flop, $2 may start immediately after $ 1 closes its last latch. In other words: 



WO 03/071681 PCT/IB03/00282 

4 

the clock generator should make $2 using the output of the last buffer in the $ 1 clock line as 
its timing reference. 

A preferred embodiment of the present invention can be summarized as 
5 follows. A clock strategy is provided for digital circuits inside mixed-signal ICs. An 

integrated circuit in accordance with the present invention comprises a plurality of pairs of 
latches LI, L2 being respectively clocked by two non-overlapping clock signals * 1, #2. The 
clock strategy is aimed at keeping the substrate bounce caused by the digital circuits as low 
as possible. Preferably, not all latches are clocked at the same time, but delays are inserted in 

10 the clock line so that the various latches do not consume current all at the same time. The 
invention relaxes the demands on the substrate sensitivity of the analog circuits. 

The present invention offers the following advantages over the prior art: low 
substrate bounce, no clock skew, and a design approach that is identical to design approach 
of "normal" synchronous circuits, i.e. all 'mainstream' design tools can be used. At the end of 

15 the design process the flip-flops are replaced by double latches and clock lines. The clock 
control block (used in synchronous digital ICs to prevent clock skew and enter test modes) 
does not have to take the direction of data flow with respect to the clock into account. The 
lower substrate bounce can be used to reduce on-chip decoupling. Advantageously, the 
measure of the present invention does not necessitate a circuit to contain more flip-flops than 

20 prior art circuits to obtain a current-pulse-spreading in time. Also, the clock lines do not need 
to contain more inverters that the prior art clock tree. The advantages of the present invention 
can thus be obtained without a need for additional circuitry. 

It should be noted that the above-mentioned embodiments illustrate rather than 
limit the invention, and that those skilled in the art will be able to design many alternative 

25 embodiments without departing from the scope of the appended claims. The clock signals 
$ 1 , $2 do not need to have the same duty cycle. In the claims, any reference signs placed 
between parentheses shall not be construed as limiting the claim. The word "comprising" 
does not exclude the presence of elements or steps other than those listed in a claim. The 
word "a" or "an" preceding an element does not exclude the presence of a plurality of such 

30 elements. In the device claim enumerating several means, several of these means can be 
embodied by one and the same item of hardware. The mere fact that certain measures are 
recited in mutually different dependent claims does not indicate that a combination of these 
measures cannot be used to advantage. 
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CLAIMS: 



1 . A integrated circuit comprising: 

a plurality of pairs of latches (LI , L2) being respectively clocked by two non- 
overlapping clock signals ($1, $2). 

5 2. A integrated circuit as claimed in claim 1, wherein delay circuits (r) are placed 

between clock inputs of latches (L) of a same clock phase ($ 1 or $2). 

3. A integrated circuit as claimed in claim 2, wherein each delay circuit (B) 

drives more than one latch. 
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